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Abstract 

(N 

Given a set P of n planar points, two axes and a real-valued score function /() 
on subsets of P, the Optimal Planar Box problem consists in finding a box (i.e. 
axis-aligned rectangle) H maximizing f{H D P). We consider the case where /() is 
^ ^ monotone decomposable, i.e. there exists a composition function g{) monotone in 

its two arguments such that f{A) — g{f{Ai),f{A2)) for every subset A C P and 
every partition {^1,^2} of A. In this context we propose a solution for the Optimal 
Planar Box problem which performs in the worst case 0{n^ Ig n) score compositions 
and coordinate comparisons, and much less on other classes of instances defined by 
I— I various measures of difhculty. A side result of its own interest is a fully dynamic MCS 

Splay tree data structure supporting insertions and deletions with the dynamic finger 
property, improving upon previous results [Cortes et al., J.Alg. 2009]. 

1 Introduction 

> 

Consider a set P of n planar points, and two axes x and y forming a base of the plane, 
such that the points are in general position (i.e. no pair of points share the same x or y 
fSI coordinate). We say that a real-valued function /() on subsets of P is decomposable [2]|8] if 

^ there exists a composition function gQ such that f{A) = g{f{Ai), f{A2)) for every subset 

O A O P and every partition {^1,^2} of ^. Without loss of generality, we extend /() to 

P such that f{p) = f{{p})- A decomposable function is monotone if the corresponding 
• • composition function g{) is monotone in its two arguments. A box is a rectangle aligned 

to the axes, and given a monotone decomposable function /(), such a box is /()-optimal 
^ if it optimizes /{HCiP). Without loss of generality, we assume that we want to maximize 

/() and that its composition function g() is monotone increasing in its two arguments. 
Given a monotone decomposable function /() well defined for the empty set 0, a point 
p of P is positive if f{p) > f{0). Otherwise, this point p is negative. Observe that if p 
is positive then f{Au{p}) = g{f{A)J{p)) > g{f{A),f{0)) = f{A) by monotonicity of 
g{): hence a point p is positive if and only if f{A U {p}) > f{A) for every subset A G P 
not containing p. A stripe is an area delimited by two lines parallel to the same axis. A 
positive stripe (resp. negative stripe) is one which contains only positive (resp. negative) 
points. A monochromatic stripe is a stripe in which all points have the same sign. 
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Given a set of planar points, a simple example of such monotone decomposable functions 
is counting the number of points contained in the box. Further examples include counting 
the number of blue points; returning the difference between the number of blue points and 
the number of red points contained; returning the number of blue points in the box or 
— oo if it contains some red points; summing the weights of the points contained; taking 
the maximum of the weights of contained points; etc. 

Given a set P of n planar points and a real-valued function /() on subsets of P, the 
Optimal Planar Box problem consists in finding an /()-optimal box. Depending of 
the choice of the function /(), this geometric optimization problem has various practical 
applications, from identifying rectangular areas of interest in astronomical pictures to the 
design of optimal rectangular forest cuts or the analysis of medical radiographics. We 
present various adaptive techniques for the Optimal Planar Box problem: 



In the worst case over instances composed of n points, our algorithm properly gen- 
erahzes Cortes et al.'s solution ^ for the Maximum Weight Box problem, within 
the same complexity of O(n^lgn) score compositions. 

For any 5 G [l..n] and ni,...,ns G [l..n] summing to n, in the worst case over 
instances composed of 6 monochromatic stripes of alternating signs when the points 
are sorted by their y-coordinates, such that the i-th stripe contains rii points, our 
algorithm exec utes 0{6n{l + Ti{ni, . . . ,ns))) C 0{6nlg{5 + 1)) score compositions 
(Theorem 4.1), where 'H{ni, . . . ,ns) = ^i=i(?T-j/?T') lg(ra/f^j) is the usual entropy 
function. 

Assuming the same y-coordinate order, for any A G [0..n^], in the worst case over 
instances where A is the sum of the distances between the insertion positions of the 
consecutive points according to their x-coordinate, our algorithm executes 0(n^(l + 



lg(l-|-A/n)) score compositions (Lemma 5.1 ). Measure A relates to the local insertion 
sort complexity [13^ of the sequence of x-coordinates. It holds A G 0(n-|-lnv), where 
Inv is the number of disordered pairs in the sequence. When the points are grouped 
into 6 monochromatic stripes, this complexity drops to 0{n5{l + lg(l + Inv/n)) 



(Theorem 6.1 ). 



Assuming the same y-coordinate order, for a minimal cover of the same sequence of 
x-coordinates into p < n runs (i.e. contiguous increasing subsequences) of lengths 
n, . . . ,rp, our algorithm executes 0(n^(l + ^.{ri, . . . , rp))) C 0(n^ lg(p + 1)) score 



compositions (Lemma 5.3). When the points can be grouped into 5 monochromatic 
stripes, this complexity decreases to 0{n5{l + ?^(ri, . . . , rp))) C 0{n5\g[p + 1)) 



(Theorem 6.1 again 



In the case where subsets of points are clustered along the diagonal, our algorithm 
reduces to the corresponding sub-instances in linear time via a clever partitioning 



strategy (Theorem 7.7), applying previous techniques on those sub-instances directly 



or indirectly (Theorem 8.3). 



We describe our algorithm progressively as follows: 



1. After describing the existing solutions to more specific problems, we present a solu- 
tion for the Optimal Planar Box problem performing O(nlgn) coordinate com- 
parisons and 0(n^ Ig n) score compositions in the worst case over instances composed 
of n points, in Section [2] 
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We describe a truly dynamic MCS tree data structure, supporting in good amortized 
time the insertion and deletion of points (as opposed to their mere activation, as in 
the original variant described by Cortes et al. |6i) and of sequences of points ordered 
by their coordinates along one axis, such as to support in particular the Dynamic 
Finger property on score compositions and comparisons, in Section [3] 

Observing that an instance composed mostly of positive or negative points is easier 
than a general instance, we describe a technique to detect and take advantage of 
monochromatic stripes, hence taking advantage of the sign distribution of the values 
of /(), in Section |4] 

Observing that the problem can be solved in O(nlgn) coordinate comparisons and 
0{n) score compositions when the points are projected in the same order on both 
axis, we describe in Section[5]a variant of the algorithm which takes advantage of the 
relative insertion positions of the consecutive points into a sorted sequence, by using 
our new dynamic MCS tree data structure. The resulting cost is related to that of 



local insertion sorting 10,13,14 , and we relate it with other measures of disorder 
of permutations, such as the number of inversions and the number of consecutive 
increasing runs 
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5. Observing that sorting the points within each monochromatic stripe can only fur- 
ther reduce the inversions and runs complexity between the orders of the points, 
we combine the two previous techniques into a single algorithm, whose complexity 
improves upon both techniques, in Section [6) 

6. Observing that one dimensional instances can be solved in linear time even when 
plunged in the plane, whereas the techniques described above have quadratic com- 
plexity on such instances, we describe how to decompose instances where the points 
are even only partially aligned into diagonals of blocks, which yields an algorithm 
whose best complexity is linear and degrades smoothly to the previous worst case 
complexity as the points lose their alignment. 

7. Finally, observing that sub-instances which cannot be decomposed into diagonals 
of blocks have a specific "windmill" shape, we show how to take advantage of this 
shape in order to still take advantage of particular instances. 

The proofs of the most technical results are presented in the appendix, following an exact 
(automatic) copy of the statement of the corresponding result. 



2 Optimal Boxes and Related Problems 

Given a set P of n weighted planar points, in which the weight of a point can be either 
positive or negative, the Maximum Weight Box problem [g] consists in finding a box 
R maximizing the sum of the weights of the points in Rn P. Cortes et al. [6] gave an 
algorithm solving this problem in time O(n^lgn) using 0(n) space, based on MCS trees, 
a data structure supporting in O(lgn) time the dynamic computation of the Maximum- 
Sum Consecutive Subsequence problem [3] (hence the name "MCS"). Their solution 
applies to other similar problems, such as the Maximum Subarray problem |l6] which, 
given a two-dimensional array, consists in finding a subarray maximizing the sum of its 
elements. 
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The Maximum Weight Box problem fcj and, by successive reductions, the Maximum 



SuBARRAY problem [16], the Maximum Box problem [6j[9|[12], and the Maximum Dis- 
crepancy Box problem (6||7] can all be reduced to a finite number of instances of the 
Optimal Planar Box problem by choosing adequate definitions for the score functions 
/() to optimize. 

Cortes et al.'s algorithm fH] first sorts the points by their y-coordinate in O(nlgn) time 
and then traverses the resulting sequence of points pi,p2, ■ ■ - Pn as follows. For each pi, 
it sets an MCS tree (described in more details in Section [3]) with points pi, . . .pn, where 
the key is their x-coordinate Xi, and all have value /(0). It then successively activates 
points pj for j £ [i..n], setting its weight to value f{pj), updating the MCS tree so that 
to compute the optimal box contained between the y-coordinate of pi to that of pj . The 
whole algorithm executes in time O(n^lgn), corresponding to activations in the tree, 
each performed in time O(logn). 



3 Fully Dynamic MCS Trees 



Cortes et al. [6] defined the MCS tree as an index for a fixed sequence S = ,„] of n 

elements, where each element of S has a weight wlxk) £ K, so that whenever a weight 
w{xk) is updated, a consecutive subsequence {xi)i^yi^j.] of S maximizing r] ^(^*) 

obtained (or recomputed) in O(lgn) time. This behavior is dynamic in the sense that it 
allows modification of element weights, yet it is only partially dynamic in the sense that 
it admits neither addition of new elements nor deletion of existing elements. 

Existing dynamic data structures can be easily adapted into a truly dynamic data structure 
with the same functionalities as MCS trees. We start by generalizing Cortes et al.'s 
algorithm and data-structure [6j from mere additive weights to monotone decomposable 
score functions in Lemma 3.1 We further generalize this solution to use an AVL tree [I] 



in Lemma 3.2 and a Splay tree [15| in Lemma 3.3, whose "finger search" property will 



yield the results of Sections [4] and [5] 



Lemma 3.1 Let S he a static sequence of n elements, and /() he a monotone decom- 
posable score function receiving as argument any subsequence of S, defined through the 
activation and deactivation of each element of S. There exists a semi-dynamic data struc- 
ture for maintaining S using linear space that supports the search for an element in O(lgn) 
comparisons; the activation or deactivation of an element in O(lgn) score compositions; 
and f{)-optimal sub range queries in O(lgn) comparisons and score compositions. 

Proof. Consider a monotone decomposable score function /() on sequences (a particular 
case of monotone decomposable score functions on point sets described in Section [T| and 
its corresponding composition function gQ. 

Given a fixed sequence S = (xi)jg[i ,„] of n elements, each element Xi of S is associated 
to a score f{xi) (a more general notion than the weight previously considered, in that 
f{xi, . . . ,Xn) can be computed more generally than by merely the sum of the individual 
scores f{xi)). 

Then the MCS tree data structure consists of a binary balanced tree T with n leaves, 
in which the leaves of T from left to right store the elements xi, X2, . . . , 

of S. The 

term interval is used to denote the first and last indices of a consecutive subsequence of 
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S joint with the score of the corresponding sequence. We denote by [xi,Xr-] the interval 
corresponding to the subsequence (xi)jg[;, and by [x^]. Let denote the empty 

interval whose score is equal to zero, and given intervals h, . . . ,It let max{/i, . . . , It} be 
the interval of maximum score among Ii, . . . ,It. Each node t; of T stores the following 
four maximum-score intervals, where a;;,x;_|_i, . . . ,Xr are, from left to right, the leaves of 
T descendant of v: 

• I{v) = [xi,Xr], the information about the full sequence; 

• L{v) = max{[,T;, xi], [xi,xi^i], . . . , [xi, Xr], 0}, the information about the best prefix 
of the sequence (where L stands for "Left" ) ; 

• R{v) = max{[,T;, Xr], [xi-\-i, X,]; . . . , [xr, Xr], 0}, the information about the best suffix 
of the sequence (where R stands for "Right" ) ; and finally 

• M{v) = max{max{[xii , Xj.'] : I < V < r' < r},0}, the information about the best 
factor of the sequence (where M stands for "Middle" ) . 

Then the maximum-score consecutive subsequence of S is given by the interval M() of the 

root node of T. Given two contiguous intervals [x/, Xfc] and [x^+i, x,.], let [x;, Xfc] + [xfc+i, x,.] 
denote the interval [x; , x^] , of score obtained through the combination of the scores of of 
[x;,Xfc] and [xk+i,Xr\- Consider 1 + = + I = / for every interval /. The key 
observation in the MCS tree is that if we have computed I{vi), L{vi), R{vi), M{vi), 
I{v2), L(v2), R{v2), and M{v2), where nodes vi and V2 arc the left and right children 
of node v, respectively, then I{v), L{v), R{v), and M{v) can be computed through a 
constant number of operations: I{v) = I{vi) + 1(^2), L(v) = meiK{L{vi), g{I{vi), L(v2))}, 
R{v) = ma.x{g{R{vi),I{v2)),R{v2)}, and M{v) = max{M{vi), M{v2), g{R{vi), L{v2))}. 
These observations can be used, whenever the score of an clement x,; of S is modified, to 
update through a constant number of operation each node in the path from the leaf of T 
storing the score of a sequence containing Xj to the root. Since T is a balanced tree this 
corresponds to O(lgn) operations. The MCS tree data structure also supports optimal 
subrange queries through O(lgn) operations, that is, given a range R of sequence S reports 
a subrange R' of R such that the score of S in R' is maximized. □ 

The MCS tree data structure can be converted into a truly dynamic data structure sup- 
porting both insertions and deletions of elements. This data structure can be used to 
index a dynamic sequence S = (xj)jg[i,,„] of n elements so that whenever an element is 
inserted or removed, a consecutive subsequence S' = (xi)jg[;,y] of S optimizing f{S') can 
be (re)computed in O(lgra) score compositions and comparisons. The following lemma 
establishes the property of this data structure, which we call MCS AVL tree. 

Lemma 3.2 Let S be a dynamic sequence of n elements, and /() be a monotone de- 
composable score function receiving as argument any consecutive subsequence of S. There 
exists a fully dynamic data structure for maintaining S using linear space that supports 
the search for an element in O(lgn) comparisons; the update of the weight of a point in 
O(lgn) score compositions, the insertion or deletion of an element in O(lgn) compar- 
isons and score compositions; and fQ-optimal subrange queries in O(lgn) comparisons 
and score compositions. 

Proof. Let S = (xi)jg[i /() the monotone decomposable score function, and g{) its 
composition function. Consider an AVL tree T of n nodes such that for i € [l..n] the i-th 
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node in the in-order traversal of T stores element Xi of S. We now generalize the MCS 
data structure. 

Each node v of T, in which the in-order traversal of the subtree rooted at v reports the 
elements x/, . . . , of 5, is augmented with the same four intervals I{v), L{v), R{v), 
and M{v) used by the MCS data structure. If node v stores the element Xk and if the 
intervals /(fi), L{vi), R{vi), M{vi), I{v2), L{v2), R{v2), and M{v2) of the left (vi) and 
right (^2) children of v have been computed; then I{v), L{v), R{v), and M{v) can be 
computed in constant number of score compositions as follows: 



• I{v) = I{vi) + [Xk]+I{v2) 

• L{v) = max{L(i;i), + [xk] + L{v2)] 

• R{v) = max{R{vi) + [xk] + 1(^2), i?(i'2)} 

• M{v) = max{M{vi),M{v2), R{vi) + [xk] + L{v2)} 



In this computation the value /() of every interval of the form Ii + [x^] + I2 is equal to 
9{9{f i^i) 1 f i^k)) , f {S2)) , where 5i and 5*2 are the subsequences corresponding to Ii and 
I2, respectively. For empty (or null) nodes v, I{v) = L{v) = R{v) = M{v) = 0. This 
states how these intervals are computed for leaves and one-child nodes. We show that the 
computation of L{v) is correct by considering separately the case where L{v) contains x^ 
and the case where it does not. If Xk is not in L{v), then L{v) must consider only the 
elements xi, x;_|_i, . . . , Xk-i, and is thus equal to L{vi). Otherwise, if Xk belongs to L{v), 
then L{v) must have the form I{vi) + [xk] + 12, where I2 is either or an interval of the 
form [xk-f-i, Xj], j €z [k + l..r]. Observe that I{v2) has the same form as I2 and maximizes 
/(). Since g{) is monotone increasing in its first and second arguments, then 

f{l(v,) + [xk]+l2) = g{gif{iivi)),fixk)),f{l2)) 

< g{g{f{l{v^)),f{xk))J{l{v2))) 

= f{I{vi) + [xk]+I{v2)). 

Therefore, I{vi) + [xk] + -^("^^2) is a correct choice for L(v) if Xk belongs to L{v). Similar 
arguments can be given for R{v) and M{v). 

If the elements of S are ordered by some key, then T is a binary search tree on that key, and 
insertions and deletions are performed by using the binary search criterion. Otherwise, if 
elements of S are not ordered, we further augment every node v oi T with the number 
of nodes in the subtree rooted at v in order to perform both insertions and deletions 
by rank js]. In either case, whenever we insert or delete an element we spend O(lgn) 
comparisons. Furthermore, when a leaf node is inserted, or a leaf or one-child node is 
deleted, we perform a bottom-up update of all nodes of T in the path from that node to 
the root. Let v be any of those nodes. If the update at v does not require a rotation, 
then the augmented information of v is computed in 0(1) score compositions from the 
same information of its two children nodes, which are already computed. Otherwise, if 
the update v requires a rotation, once the rotation is performed, we update bottom-up 
the augmented information of the nodes changed in the rotation (they include v and the 
node getting the position of v) from the same information of their new children nodes. 
This is done again in 0(1) score compositions since there are at most three affected nodes 
in any rotation and the augmented information of their new children nodes is already 
computed. The update of v always requires 0(1) score compositions, and then O(lgn) 
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score compositions are used in total at each insertion or deletion. The subsequence S' 
corresponding to the interval M() of the root of T is the subsequence of S that maximizes 
/(), and is updated at each insertion or deletion. 

Optimal subrange queries on T can be performed as follows. Let xi and Xr be the first 
and last elements of the input range R, respectively. Using O(lgn) comparisons we find: 
the node vi storing xi, the node Vr storing x^, and the least common ancestor node vi^^ 
of vi and Vr- Then the intervals Ri, Mi, Lr, and Mr are computed: We first initialize 
Ri := Ml := 0, and after that, for each node v taken bottom- up from vi to the left 
child of vi^r, such that element Xk stored in v belongs to R, we perform the updates 
Rl := max{i?/ + [xk] + I{v2),R{v2)} and Mi := mai^{Mi, M{v2), Ri + [xk] + ^(^'2)} in 
this order, where V2 is the right child of v. Similarly, we initialize Rr := M^ := and 
for each node v taken bottom-up from Vr to the right child of f;^, such that element 
Xk stored in v belongs to R, we do Lr := m.ax{L(vi), I{vi) + [xk] + Lr} and Mr := 
max{M{vi), Mr, R{vi) + [xk] + Lr}, where vi is the left child of v. Finally, the optimal 
subrange of R is equal to max{M;, M^, i?; + [xi^r] + Lr}, where xi^r is the element of S 
stored in vi^r- The whole computation requires O(lgn) score compositions. 

Therefore, the tree T satisfies all conditions of the lemma. □ 



The Splay tree is a self-adjusting binary search tree created by Sleator and Tarjan [15] . 
It supports the basic operations search, insert and delete, all of them called accesses, in 
O(lgn) amortized time. For many sequences of accesses, splay trees perform better than 
other search trees, even when the specific pattern of the sequences are unknown. Among 
other properties of Splay trees, we are particularly interested in the Dynamic Finger 
Property, conjectured by Sleator and Tarjan 15 and proved by Cole et al. [ij: every 
sequence of m accesses on an arbitrary n-node Splay tree costs 0{m + n + Y^^=i ^s{dj + l)) 
rotations where, for j = l..m, the j-th and {j — l)-th accesses are performed on elements 
whose ranks among the elements stored in the Splay tree differ by dj. For j = 0, the j-th 
element is the element stored at the root. It is easy to see that in the MCS AVL tree we 
can replace the underlying AVL tree by a Splay tree, and obtain then the next lemma, 
which describes the MCS Splay tree data structure. 



Lemma 3.3 Let S he a dynamic sequence of n elements and /() he a monotone decom- 
posable function receiving as argument any consecutive suhsequence of S. There exists a 
data structure for maintaining S that uses linear space and supports the search in O(lgn) 
amortized comparisons, the update of the weight of a point in O(lgn) amortized score com- 
positions, and the insertion and deletion of elements in O(lgn) amortized comparisons and 
score compositions. Joint with the insertion or deletion of any element, the consecutive 
subsequence S' of S maximizing f{S') is recomputed. The Dynamic Finger Property is 
also satisfied for each operation (search, insertion and deletion), both for the number of 
comparisons and for the number of score compositions performed. 



Proof. The complexities of the accesses, and also the Dynamic Finger Property, follow 
from the Splay tree properties and from the fact that the augmented information in each 
node can be computed in a constant number of score compositions and comparisons, from 
the corresponding information stored in its children nodes. Since after each rotation the 
augmented information of the affected nodes of the rotation is updated, the consecutive 
subsequence S' of S maximizing /(£"), which is stored at the root, is recomputed. 

As in the case of the MCS AVL tree, updating the weight of an element can be reduced 
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to removing and inserting it with its new weight, so that its support in O(lgn) amortized 
comparisons and score compositions is a simple consequence of the support of insertions 
and deletions. Obviously, in practice the update operator can be implemented much more 
simply but still requires a rebalancing of the tree on the element accessed in order to yield 
the amortized complexity. □ 



4 Taking Advantage of Monochromatic Stripes 



Consider an instance where positive and negative points can be clustered into 6 positive 
and negative stripes along one given axis, of cardinalities m, . . . , ns- On such instances 
one does not need to consider boxes whose borders are in the middle of such stripes: all 
optimal boxes will start at the edge of a stripe; specifically, the top (resp. bottom) of an 
optimal box will align with a positive point at the top (resp. bottom) of a positive stripe. 

This very simple observation not only limits the number of boxes for which we need to 
compute a score, but also it makes it easier to compute the score of each box: adding 
the rii points of the i-th stripe in increasing order of their coordinates in a MCS Splay 
tree of final size n amortizes to 0{n + Yli=i '^i lg("'/^i)) coordinate comparisons and score 
compositions. The reason is that the rij distances dj + 1 of Lemma 3.3 telescope to at 
most n + rii within stripe i, and thus by convexity the cost 0(?^ + X]j=i lg(^j + 1)) upper 
bounded by 

^ra + TOj lg(l + n/uj)^ C O ^ + Y]ni lg{n/ni)^ 

= 0{n{l + n{ni,...,ns))) 
C 0(nlg(<5 + l)) 

Combining this with the fact that the top of an optimal box is aligned with a positive 
point at the top of a positive stripe yields the following result. 



Theorem 4.1 For any 6 E [l..n] and ni,...,ns G [l..n] summing to n, in the worst 
case over instances composed of 5 stripes of alternating signs over an axis such that 
the i-th stripe contains Ui points, there exists an algorithm that finds an f{)-optimal 
box in 0{5n{l + T-L{ni, . . . ,ns))) C 0{6nlg{6 + 1)) score compositions and 0{5n{l + 
'H{ni, . . . ,ns)) + nlgn) C 0{5nlg{6 + 1) + nlgn) coordinate comparisons. 



5 Taking Advantage of Point Alignments 

Running the algorithm outlined in the first paragraph of Section |4] over the MCS Splay tree 
has further consequences. In this section we show how it makes the algorithm adaptive to 
local point alignments. 

Imagine that, once the points pi,P2, ■ ■ ■ ,Pn have been sorted by y-coordinate, they also 
become sorted by x-coordinate. By hypothesis, no pair of points are aligned to any axis, 
so that the second order is a permutation of the first one. Call vr the permutation that 
rearranges the points pi,p2, ■ ■ ■ ,Pn from an ordering sorted by x-coordinates to an ordering 
by y-coordinates. 
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Then, when we insert the points in the MCS Splay tree, the distance from each insertion 
to the previous one is equal to 1, and therefore the overall cost of the algorithm is O(n^) 
score composition (recall the description of Cortes algorithm in Section [2]). 

More generally, assume that the x-coordinate xj of pj falls at position rj in the sorted set 
of previous x-coordinates {xi, X2, • • • , Xj-i}. Then we have di = and dj = \rj — rj_i| for 
j > I, according to Lemma |3.3l Let us define A = X]j=2 l^i ~ ''i-il local insertion 

complexity of the sequence \T3l Note that a simple upper bound is A < Yl]=2 ~ '''"i-il' 
as the latter refers to the final positions of the elements, whereas A refers to the moves 
needed in the current prefix of the permutation. 

The cost of our algorithm using the MCS Splay tree can then be upper bounded as 
follows. When we insert the points in the MCS Splay tree starting from pi, the total cost 
is 0{n + Yl^=i ^sidj + 1)) C 0{n + nlg(l + A/n)) score compositions, by convexity of the 
logarithm and because J2j=i dj + 1 < X + n. A simple upper bound when considering all 
the n passes of the algorithm is obtained as follows. 



Lemma 5.1 Let P be a set of n points in the plane. Let /() he a monotone decomposable 
score function receiving as argument any subset of P. There exists an algorithm that finds 
an fQ-optimal box in 0(n^(l + lg(l + A/n))) score compositions and 0{n^{l + lg(l + 
A/n)) +nlgn) coordinate comparisons, where X < is the local insertion complexity of 
the sequence of x -coordinates of the points sorted by y- coordinates. 



In the worst case this boils down to the 0(n^ Ig n)-worst-case algorithm, whereas in the 
best case A = and the cost corresponds to O(n^) operations. 

We can upper bound A by using other measures of disorder in permutations. For example, 
let us consider Inv, the number of inversions in the permutation vr, or said another way, the 



number of pairs out of order in the sequence 14 . The measure Inv corresponds to a cost 



where the "finger" is always at the end of the sequence. This can be as small as (A — n)/2, 
for example consider the permutation tt = (m, m — 1, m + 1, m — 2, m + 2, . . . , 1, 27n — 1) 
for m = (n + l)/2 and odd n. However, Inv can be much larger than A because it is 
not symmetric on decreasing sequences, for example when the points are semi-aligned in 
a decreasing diagonal and the permutation is tt = (n, n — 1, n — 2, . . . , 1). Thus replacing 



A by Inv in Lemma 5.1 yields a valid upper bound in terms of big-0 complexity. 



Lemma 5.2 lemma:relacionInvLambda For any permutation tt of local insertion complex- 
ity X and presenting Inv inversions, 

Inv > A/2 - n. 



Proof. For each Xj, X increases by ItTj — TTj-i\, whereas Inv increases by j — ttj. Since 
TTj < j, if TTj > TTj-i, we can use that {j — 1) — 7rj_i + 1 > vr^ — vTj-i, or else we use 
j — TTj > TTj-i — TTj. So either the contribution of Xj-i plus 1, or that of Xj, to Inv, upper 
bounds the contribution of Xj to A. By summing up both contributions (that of xj-i 
plus 1, and that of Xj) for each j we have 2 • Inv + n and this upper bounds A. See also 
Estivill-Castro and Wood's survey [lO]. □ 



Another well-known measure of permutation complexity is the number of increasing runs p, 
that is, the minimum number of contiguous monotone increasing subsequences that cover 
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TT 11 . Let n, . . . , be the lengths of the runs, computed in 0{n) comparisons. Then the 
sum of the values {t^j+i —t^j \ within the i-th run telescopes to at most n, and so does the sum 
of the dj values. Therefore Z]j'=i ^s{dj + 1) < Y,i=i lg(l + n/ri) < n + Y,i=i lg("-/n) 
by convexity. This leads to the following alternative upper bound. 



Lemma 5.3 Let P he a set of n points in the plane. Let /() he a monotone decomposable 
function receiving as argument any suhset of P. There exists an algorithm that finds 
an f{)-optimal box in O(nlgn) coordinate comparison and 0(n^(l + 7i{ri, . . . ,rp)) C 
0(n^ lg(p+l)) score compositions, where ri, . . . , are the lengths of p maximal contiguous 
increasing subsequences that cover the sequence of x- coordinates of the points sorted by y- 
coordinate. 



6 Taking Advantage of both Stripes and Alignments 



The combination of the techniques of Sections |4] and [5] can be elegantly analyzed. A simple 
result is that we need to start only from 5 different pi values, and therefore an upper bound 
to our complexity is 0(n5((l+lg(l+A/n))). We can indeed do slightly better by sorting the 
points by increasing x-coordinates within each monochromatic stripe. While the measure 
A' resulting from this reordering may be larger than A, the upper bounds related to Inv 
and p, namely Inv', p' , and 7i{n[, . . . ,n'^,), do not increase. In particular it is easy to see 
that the upper bound of Theorem |4.l| is dominated by the combination since p' < 6 and 
T-l{r[, . . . ,r'p,) < 7i{ni, . . . ,ns) (because no run will cut a monochromatic stripe once the 
latter is reordered). 

Theorem 6.1 Let P be a set of n points in the plane. Let /() be a monotone decom- 
posable function receiving as argument any suhset of P. There exists an algorithm that 
finds an f{)-optimal box in 0(n Ign) coordinate comparisons and 0{n6{l + min(lg(l + 
Inv/n), ?^(ri, . . . , Tp)))) C 0{n5lg{p + 1)) score compositions, where 6 is the minimum 
number of monochromatic stripes in which the points, sorted by increasing y- coordinate, 
can he partitioned; X is the corresponding sequence of x-coordinates once we (re-) sort 
by increasing x-coordinate the points within each monochromatic stripe; Inv < is the 
number of out-of-order pairs in X; and ri, . . . , are the lengths of the minimum number 
P ^ S of contiguous increasing runs that cover X . A similar result holds by exchanging x 
and y axes. 



Note that if these new measures are not particularly favorable, the formula boils down to 
the 0{n5\g5) time complexity of Section El 



7 Taking Advantage of Diagonals of Blocks 



In this section we take advantage of the relative positions of the points to obtain a different 
adaptive algorithm. We will consider partitions of the point set into two subsets. These 
partitions are induced by two lines which are parallel to the axes and perpendicular each 
other. A combination of optimal boxes of each of the subsets will lead to the optimal box 
of the whole point set. 

For any subset ^ C P, a diagonalization of A is a partition {^i, ^2} of A induced by two 
lines ii and £2, respectively parallel to axes x and y, so that the elements of Ai and the 
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elements of A2 belong to opposite quadrants with respect to the point £irii2- Figure [T] gives 
some example of diagonalization. In particular, assuming that the points of Ai are to the 
left of the points of A2, we call the diagonalization bottom-up (Figure [T^) if the elements 
of Ai are all below the elements of A2 and top-down (Figure [Tja) otherwise. Note that if 
Pi,P2t ■ ■ ,Pm denote the elements of A sorted by x-coordinate, then any diagonalization 
of A has the form {{pi, ■ ■ ■ ,Pk}, {Pk+i, ■ ■ ■ ,Pm,}} for some index k G [l..m — 1]. 



• • 

. A2 ' 
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• • 

• 

• 








• • 
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(a) 



(b) 



Figure 1: A diagonalization {Ai, ^2} of the point set A = Ai U 

Given any bounded set S C M'^, let Box(S') denote the smallest enclosing box of S and let 
the extreme points of A be those belonging to the boundary of Box(74). 

Not all point sets can be decomposed into diagonals, the simplest case being a set of four 
points placed at the four corners of a square which sides are slightly rotated from the axes 
X and y. We call such a point set a windmill, for the characteristic position of its extreme 
points. See Figure [2] for an example. 



(a) 



(b) 



Figure 2: Windmills. 



Lemma 7.1 Let A be a point set that does not admit a diagonalization. Then A has 
exactly four extreme points. Furthermore, A has a windmill which contains at least one 
extreme point of A. 

Proof. The first part of the lemma is easy to show. We proceed to prove the second part. 
Let {p, q, r, s} be the extreme points of A. If they form a windmill then we are done. 
Otherwise, assume without loss of generality that their relative positions are as depicted 
in Figure [3^. If there is an element q' of A in region Ri then {p, q', r, s} is a windmill 
and we are done. Analogously, if there is a point s' of A in region R2 then {p, q, r, s'} is a 
windmill and we are done. Assume then that Ri U R2 is empty of elements of A. Since A 
does not admit a diagonalization then i?3Uii4 must contain a point of A. Assume without 
loss of generality that R3 contains a point. Let t be the rightmost point of R^ creating 
regions R^ and Rq as shown in Figure [sjs. If there is a point u in R^ then set {p, q, t, u} is 
a windmill and we are done. Analogously, if there is a point u in Rq then set {p, t, r, u} is 
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Figure 3: Proof of Lemma 7.1 In each figure the shaded area is empty of points of set A. 



a windmill and we are done. Then assume -R5 U Rq is empty of elements of A. Now, since 
A does not admit a diagonalization, region R4 must contain a point of A. Let then t' be 
the bottom- most point of i?4, which induces the regions Rq and Rj depicted in Figure [3};. 
We now proceed with t' , similar as we did with t. If region Ri contains a point u oi A then 
{p, t, u, t'} is a windmill and the result follows. Otherwise, region Rs must contain a point 
u' of A, because in the contrary case A would have a diagonalization. Then {t, u' , s, t'} is 
a windmill. Therefore, the result follows. □ 

Definition 7.2 A diagonalization tree of P, D-tree, is a binary tree such that: (i) each 
leaf u contains a subset S{u) C P which does not admit a diagonalization, (ii) set 
{S{u) \ u is a leaf } is a partition of P, and [Hi) each internal node v has exactly two 
children vi (the left one) and V2 (the right one) and satisfies that {A(vi), A{v2)} is a di- 
agonalization ofA{v), where for each node v A{v) denotes the union of the sets S{u) for 
all leaves u descendant of v (See Figure^. 
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Figure 4: A D-trec of the point set {pi, . . . ,pi3}. 
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Lemma 7.3 Let P be a set of n points in the plane. Every D-tree of P has the same 
number of leaves. Furthermore, the i-th leaves from left to right of any two D-trees of P 
contain the same subset S{-) of P. 



Proof. We use induction on the number of elements of P. If P contains only one element, 
or P does not admit a diagonalization, then we are done. Otherwise, assume that P has 
exactly A; > 1 different diagonalization, all of which must be of the same type (bottom- 
up or top-down). Then P is partitioned into k + 1 non-empty sets denoted from left 
to right Pi, P2, . . . , Pk+i such that the k diagonalizations are Pi,\^i^j^i Pi} for 

j = 1 . . .k. Observe then that no set Pi admits a bottom- up diagonalization and that any 
L>-tree of P can be obtained by building a binary tree, whose leaves from left to right are 
Pi, P2, . . . , Pk+i and each internal node has two children, and replacing every leaf Pi by a 
D-tree of Pi. Applying the induction hypothesis for Pi,P2, . . . , Pk+i the result follows. □ 



From Lemma 7.3 we can conclude that every D-tree T of P induces the same partition of 



P, which is equal to {S{ui), . . . , ^(li^)}, where tti, . . . , are the leaf nodes of T. 

Lemma 7.4 Let P be a set of n points in the plane. A D-tree of P requires 0{n) space 
and can be built in 0{n\gn) comparisons. 

Proof. Let pi,p2, . . . ,Pn^^ the elements of P sorted by x-coordinate, and let p-ki-,P-k2i ■ ■ ■ 1 
p^^ be the elements of P sorted by y-coordinate. Both orders can be obtained in O(nlgn) 
coordinate comparisons. 

Considering the computation of permutation vr as a preprocessing, we now show that: 
If P admits a diagonalization {{pi, . . . ,pk}, {pk+i, ■ ■ ■ ,Pn}} then it can be determined in 
0{mm{k, n — k}) comparisons. Otherwise, if P does not admit a diagonalization, then it 
can be decided in 0{n) comparisons. 

For each index i G [l-.n], let Mi{i) = maxj^^i ^ttj, mL{i) = minj^^i ^ttj, MR{i) = 
maxjg[i.,n] TTj, and mR{i) = minjg[i..„] ttj. 

Observe that if {{pi, . . . ,Pk}, {Pk+i, ■ ■ ■ ,Pn}} is a diagonalization of P, then index k G 
[l..n — 1] satisfies M^^k) = k or mL{k) = n — k + 1. Furthermore, ML{k) = k and 
mL{k) = n — k + 1 are equivalent to mnik + 1) = k + 1 and M]i{k + 1) = n — k, 
respectively. 

Then we can determine a diagonalization of P, if it exists, as follows: For j = l..[n/2j 
decide if {{pi, . . . ,pj},{pj+i, . . . ,pn}} is a diagonalization (i.e. Miij) = j or mi(j) = 
n-j + 1) or {{pi, . . . ,Pn-j}, {pn~j+i, ■ ■ ■ ,Pn}} is a diagonalization (i.e. MR{n-j + l) = j 
or mR{n — j + 1) = n — j + 1). Note that if j > 1 then M^^j), mL{j), MR{n — j -|- 1), 
and mR{n — j + 1) can all be computed in 0(1) comparisons from ML{j — 1), mL{j — 1), 
TTj, Mii{n — j + 2), mR[n — j + 2), and 7r„_j+i. Therefore, if there is a diagonalization 
{{pi, . . . ,Pk},{pk+i-, ■ ■ ■ iPri}} of P it is decided for j = min{A;,n — k} < [n/2j, and 
thus determined in 0{j) comparisons. If no diagonalization is found for any value of 
j G [l..[n/2j], then the algorithm spends 0(n) comparisons in total. 

We can then build a D-tree of P recursively as follows. Run the above algorithm for P. 
If a diagonalization {{pi, ■ . . ,Pk}, {Pk+i, ■ ■ ■ ,Pn}} of P exists, which was determined in 
0{t) comparisons where t = min{A;, n — k}, then create a root node and set as left child a 
D-tree of {pi, . . . ,pk} and as right child a D-tree of {pk+i, ■ ■ ■ ,Pn}- Otherwise, if P does 
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not admit a diagonalization, which was decided in 0{n) comparisons, then create a leaf 
node whose set S{-) is equal to P. This results in the next recurrence equation for the 
total number T{n) of comparisons, where 1 <t < [n/2j: 



W.l.o.g. assume that the constants in 0{t) and 0(n) in the recurrence are equal to one. 
Then we prove by induction that T{n) < n + nlgn. The base case of the induction is the 
second line of the recurrence equation, where n < n + nlgn always holds. In the inductive 
case, we have: 

T(n) = t + T{t) + T{n - t) 

< n + t + tlgt + {n- t)lg{n - t) 

= n + t + tlgt + {n — t) lg{n — t) + {nlgn — tlgn — {n — t) Ign) 
= n + nlgn + t — t\g{n/t) — {n — t) \g{n/{n — t)) 
= n + nlgn + n{t/n — H{t/n)) 

< n + nlgn 

The second line uses the inductive hypothesis. In the third line we add and subtract nlgn 
written in two different forms. In the fourth line we regroup terms. In the fifth line we 
introduce the binary entropy function H{x) = xlg{l/x) + (1 — 2;)lg(l/(l — x)), where 
X = t/n. Finally, in the last line we apply the analytic inequality x < H{x), which holds 
at least for x < 1/2. Thus T{n) < n + nlgn and then T(n) is O(nlgn). One can see that 
this solution is tight by considering the case t = n/2. 

It is easy to see that any D-tree of P requires 0{n) space. The result follows. □ 

Definition 7.5 For any non-empty subset A C. P the set of "ten" f{) -optimal boxes of 
A, denoted by Ten(^), consists of the following f{)-optimal boxes of A, all contained in 
Box{A): 

1. Box(^). 

2. Bopt[A), an fQ-optimal box. 

3. Bi(^), an fQ-optimal box containing the bottom-left vertex o/Box(^). 
4.. 62(^1), an fQ-optimal box containing the bottom-right vertex o/Box(^). 

5. B^{A), an fQ-optimal box containing the top-right vertex o/Box(yl.). 

6. B4^{A), an fQ-optimal box containing the top-left vertex ofBox[A). 

7. Bi^2(^); an fQ-optimal box containing the bottom vertices o/Box(A). 

8. B2,3(^); an fQ-optimal box containing the right vertices o/Box(^). 

9. B3_4(^), an fQ-optimal box containing the top vertices o/Box(^). 
10. B4_i(^), an fQ-optimal box containing the left vertices o/Box(^). 

Lemma 7.6 For any non-empty subset A C. P and any diagonalization {Ai,A2} of A, 
Ten(74) can be computed in 0(1) score compositions from Ten(^i) and Ten(742). 




0{t) + TQt) + T{n - t) 
0(n) 



n > 1, a diagonalization exists 
otherwise. 
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Proof. Suppose without loss of generality that {74i,j42} is bottom-up . Let U2 and U4 
be the bottom-right and top- left vertices of Box{A), respectively. Let maxjiJi,. . . , Ht} 
denote the /()-optimal box for A among the boxes Hi, . . . , Ht. Observe that: 

• Box(^) = Box(yli U A2) = Box(Box(^i) U Box(^2))- 

• Bopt{A) =max{Bopt(^i),Bopt(^2),Box(B3(^i)U 61(^2))}. 

• Bi(^) = max{Bi(yli),Box(Box(Ai) U Bi(A2))}. 

• B2(^) =Box(max{B2(Ai), 62(^2), Box(B2,3(^i)U 61,2(^2))} U{n2}). 

• B3(^) =max{Box(B3(Ai)UBox(^2)), 63(^2)}. 

• B4{A) =Box(max{B4(Ai), 64(^2), Box(B3,4(yli)U 64,1(^2))} U{n4}). 

• Bi,2(^) = max{Box(Bi,2(Ai) U {ti2}), Box(Box(Ai) U 61,2(^2))} 

• B2,3(^) = max{Box(B2,3(^2) U {^2}), Box(Box(.42) U 62,3(^1))} 

• Bs^A) = max{Box(B3,4(A2) U {^4}), Box(Box(^2) U 63,4(^1))} 

• B4,i(A) = max{Box(B4,i(Ai) U {ti4}),Box(Box(^i) U 64,1(^2))} 

Since each of the elements of Ten(^) can be obtained from a constant number of pairwise 
disjoint boxes of Ten(Ai) U Ten(A2), and for any two disjoint boxes H and H' we have 
f{{HUH')nA) = f{{HnA)U{H'nA)) = g{f{HnA),f{H'nA)), the result follows. □ 



Theorem 7.7 Let P be a set of n points in the plane. Let /() he a monotone decom- 
posable function receiving as argument any subset of P. There exists an algorithm that 
finds an fQ-optimal box of P in 0{n\gn + ^^^-^ hc{ni)) comparisons (on coordinates and 
indices) and 0{Yji=i hs{ni) + fi) score compositions, where {Pi, . . . , Pp} is the partition of 
P induced by any D-tree of P and /3 is the size of this partition, Ui is the cardinality of Pi, 
and hc{ni) and hs{ni) are the numbers of coordinate comparisons and score compositions 
used, respectively, to compute the "ten" fO-optimal boxes of P^. 



Proof. Build a D-tree T of P in O(nlgn) comparisons (Lemma [7]). Let ui,...,up be 
the leaves of T which satisfy S{ui) = Pi for all i G [l..n]. Compute the set Ten(5(Mi)) = 
Ten(Pj) in hc{ni) coordinate comparisons and hs{ni) score compositions. By using a post- 
order traversal of T, for each internal node f of T compute Ten{A{v)) from Ten(^(fi)) 
and Ten(^(z;2)), where vi and V2 are the children nodes of v, in 0(1) score compositions 
(Lemma 7.6). The /()-optimal box of P is the box Bopt{A{r)), where r is the root node of T 
and satisfies ^(r) = P. In total, this algorithm runs in 0(nlg n)+^^^-i^ /ic(?T.i) = 0(nlgn+ 

J2i=i hc{ni)) coordinate comparisons and Yli=i f^s{ni) + Yli=i 0{l) = 0(J2i=i hs{n-i)+fi) 
score compositions. □ 



Observe that the best case of the algorithm of Theorem 7.7 is when [^(nj)! is 0(1) for each 
leaf node Ui of the Z)-tree of P. It yields a complexity of 0{n Ig n) coordinate comparisons 
and 0{n) score compositions. The worst case is when P does not admit a diagonalization, 
but even in this case the additional 0(n Ign) coordinate comparisons, performed by the 
algorithm while trying to diagonalize, do not add to the overall asymptotic complexity. 
Hence the worst case complexity of the diagonalization algorithm reduces to the case stud- 
ied in the previous sections, with a complexity of at most 0{n Ig n) coordinate comparisons 
and O(n^lgn) score compositions. 
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8 Dealing with Windmills 



In this section we use Lemma 7.1 to obtain a variant of the algorithm in Theorem 7.7 



The set S{u) of every leaf node u of any D-tree of P does not admit a diagonalization and 
has a windmill containing an extreme point of S{u). The idea is to remove the extreme 
points of S{u) and then recursively build a -D-tree of the remaining points. This approach 
yields a diagonalization in depth of the point set, potentially reducing the number of score 
compositions. 



Definition 8.1 An extended diagonalization tree of P, D*-tree, is defined recursively as 
follows: Each leaf node u of a D-tree of P satisfying \S{u)\ > 1 is replaced by a node u' 
containing the set X{u) of the four extreme points of S{u), and if the set S{u) \ X[u) is 
not empty then u' has as its only one child a D*-tree of S{u) \ X[u). 



Lemma 8.2 Let P he a set of n points in the plane. Every D*-tree of P has the same 
number a of one-child nodes, contains n — 4a leaves nodes, and every leaf node u satisfies 
\S{u)\ = 1 or \S{u)\ = 4. A D*-tree of P requires 0{n) space and can be built in 0(nlgn + 
an) comparisons. 



Proof. The first part of the proof can be seen from Lemma 7.3 and Definition 8.1 A 



D*-tree of P can be built in 0(n Ig n + an) comparisons by following the same algorithm 
to build a D-tree of P until finding a leaf node u such that S{u) does not admit a diag- 
onalization. At this point we pay 0{n) comparisons in order to continue the algorithm 
with the set S{u) \ X{u) according to Definition 8.1 Since this algorithm finds a nodes 
u, the total comparisons are 0(nlgn -|- an). The D*-tree has n nodes of bounded degree 
and hence can be encoded in linear space. □ 



Theorem 8.3 Let P be a set ofn points in the plane. Let /() be a monotone decomposable 
function receiving as argument any subset of P. There exists an algorithm that finds an 
fO -optimal box of P in 0{nlgn -\- an) coordinate comparisons and 0{n -\- anlgn) score 
compositions, where a is the number of one-child nodes of every D*-tree of P. 



Proof. Build a L'*-tree T of P in 0{n\gn an) comparisons (Lemma 8.2). For each 
of the n — Aa leaves nodes n of T compute Teii(S'(u)) in constant score compositions. 
Then, using a post-order traversal of T, compute Ten(S'(M)) for each internal node u as 
follows: If V has two children vi (the left one) and V2 (the right one), then {A{vi) , A[v2)} 
is a diagonalization of A{v) and Ten(^(u)) can be computed in 0(1) score compositions 
from Ten(^(fi)) and Teii{A{v2)) (Lemma 7.6). Otherwise, if v is one of the a one-child 
nodes, then Ten(^(f)) can be computed in O(nlgn) worst-case comparisons and score 
compositions. Namely, if a box of Ten(j4(f)) contains a at least one point of X{u) in 
the boundary then it can be found in O(nlgn) comparisons and score compositions [6]. 
Otherwise, it is a box of Ten(yl(u')), where v' is the child of v. We pay 0(1) score 
compositions for each of the 0{n) two-child nodes and 0(n Ign) score compositions for 
each of the a one-child nodes. Then the total score compositions is 0{n + anlgn). The 
result follows. □ 
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9 Conclusions 



Cortes et al. [6] proposed a solution for the Maximum Weight Box problem based on 
a data structure maintaining the scores of a hierarchy of segments, the MCS Tree. We 
extended this solution in several directions: 

1. we showed how to replace the sum operator by a monotone decomposable function 
/(), so that Cortes et al.'s algorithm |6] can optimize more general score functions 
than merely the sum of the weights of the points; 

2. we extended the MCS tree data structure to make it fully dynamic, supporting the 
insertion and removal of points in time logarithmic in the number of points in the 
structure and in their relative insertion ranks; 

3. we described adaptive techniques to take advantage of various particularities of in- 
stances, such as the clustering of positive and negative points or the relative positions 
of the points, without sacrificing worst case complexity. 

Whereas we showed many techniques to take advantage of particular instances of the 
Optimal Planar Box problem, other types of instances are left to study, such as more 
complex forms of clustering, and a unified analysis of all the adaptive techniques presented. 
Other directions of research are the generalization of the problem to higher dimension, and 
to other shapes than boxes, such as convex polygons. 
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